Biological Data Extraction and Integration — A Research Area Background Study

نویسنده

  • Cui Tao
چکیده

My research field is highly diverse. It interweaves many different areas in information technology and bioinformatics. The system I propose to implement can automatically locate, understand, and extract online biological data independent of the source and also make it available for Semantic web agents. This research field requires background knowledge from (1) Information Extraction, (2) Schema Matching, (3) the Semantic web, (4) Data Integration, and (5) Bioinformatics. 1 Information Extraction Currently, with the fast development of the internet, both the amount of useful data and the number of web sites are growing rapidly. The web is becoming an increasingly useful information tool for computer users. However, there are so many web pages that no human being can traverse all of them to obtain the information needed. Even in the narrow domain of molecular biological data, no human can traverse all the pages that may be of interest for finding needed information. A system that can allow users to query web pages like a database is becoming increasingly desirable. One possible strategy is to extract useful information from different web pages to populate databases for further handling. I survey information extraction techniques in the following three categories: (1) traditional information extraction; (2) hidden web crawling; and (3) biological data extraction. • Traditional information extraction. For traditional information extraction, I present five major data extraction tools. Each tool represents a different major way of doing information extraction. Lixto [BFG01] is a tool for supervised wrapper generation and automated web information extraction. It generates wrappers semi-automatically and interactively by creating patterns in a hierarchical order. The user can define extraction patterns

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integration of Deep Learning Algorithms and Bilateral Filters with the Purpose of Building Extraction from Mono Optical Aerial Imagery

The problem of extracting the building from mono optical aerial imagery with high spatial resolution is always considered as an important challenge to prepare the maps. The goal of the current research is to take advantage of the semantic segmentation of mono optical aerial imagery to extract the building which is realized based on the combination of deep convolutional neural networks (DCNN) an...

متن کامل

Integration of Remote Sensing and the GIS-based Methods for Provision of Cadastral Mapping of Agricultural Areas of Ardakan City

In the fifth development plan establishment, establishment of the Cadastre System of agriculture nationwide has been defined to be the work priority of institutions and organizations responsible in the area of agriculture and equity issuance in the country. In this study, the possibility of provision of the Cadastral mapping of agriculture by a integration of the data of the remote sensing and ...

متن کامل

Integration of Remote Sensing and the GIS-based Methods for Provision of Cadastral Mapping of Agricultural Areas of Ardakan City

In the fifth development plan establishment, establishment of the Cadastre System of agriculture nationwide has been defined to be the work priority of institutions and organizations responsible in the area of agriculture and equity issuance in the country. In this study, the possibility of provision of the Cadastral mapping of agriculture by a integration of the data of the remote sensing and ...

متن کامل

Investigation of Allelopathic Effects of Medicago Sativa Aqueous Root Extract on Germination and Seedling Growth of Trifolium Sp and Beta Vulgaris (Case Study: Zarch Area In Yazd Province, Iran)

In natural ecosystems, biological relationships have serious effects on system stability. Allelopathic is negative or positive effect of a plant on other plant growth. Today this word is used for rangelands, same as agricultural areas. Because of good herbage production, Medicago is most common plant in agricultural rotations and rangelands. Therefore investigation of its allelopathic qualities...

متن کامل

Investigation of Allelopathic Effects of Medicago Sativa Aqueous Root Extract on Germination and Seedling Growth of Trifolium Sp and Beta Vulgaris (Case Study: Zarch Area In Yazd Province, Iran)

In natural ecosystems, biological relationships have serious effects on system stability. Allelopathic is negative or positive effect of a plant on other plant growth. Today this word is used for rangelands, same as agricultural areas. Because of good herbage production, Medicago is most common plant in agricultural rotations and rangelands. Therefore investigation of its allelopathic qualities...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005